Recherche | OMS COVID-19 - Base de données de recherche

Language Models for the Prediction of SARS-CoV-2 Inhibitors (preprint)

Andrew E Blanchard; John Gounley; Debsindhu Bhowmik; Mayanka Chandra Shekar; Isaac Lyngaas; Shang Gao; Junqi Yin; Aristeidis Tsaris; Feiyi Wang; Jens Glaser.

biorxiv; 2021.

Preprint Dans Anglais | bioRxiv | ID: ppzbmed-10.1101.2021.12.10.471928

Résumé

The COVID-19 pandemic highlights the need for computational tools to automate and accelerate drug design for novel protein targets. We leverage deep learning language models to generate and score drug candidates based on predicted protein binding affinity. We pre-trained a deep learning language model (BERT) on ~9.6 billion molecules and achieved peak performance of 603 petaflops in mixed precision. Our work reduces pre-training time from days to hours, compared to previous efforts with this architecture, while also increasing the dataset size by nearly an order of magnitude. For scoring, we fine-tuned the language model using an assembled set of thousands of protein targets with binding affinity data and searched for inhibitors of specific protein targets, SARS-CoV-2 Mpro and PLpro. We utilized a genetic algorithm approach for finding optimal candidates using the generation and scoring capabilities of the language model. Our generalizable models accelerate the identification of inhibitors for emerging therapeutic targets.

Sujets)

COVID-19 , Troubles du langage

Intelligent Resolution: Integrating Cryo-EM with AI-driven Multi-resolution Simulations to Observe the SARS-CoV-2 Replication-Transcription Machinery in Action (preprint)

Anda Trifan; Defne Gorgun; Zongyi Li; Alexander Brace; Maxim Zvyagin; Heng Ma; Austin R Clyde; David A Clark; Michael Salim; David Hardy; Tom Burnley; Lei Huang; John McCalpin; Murali Emani; Hyunseung Yoo; Junqi Yin; Aristeidis Tsaris; Vishal Subbiah; Jessica Liu; Noah Trebesch; Geoffrey Wells; Venkatesh Mysore; Tom Gibbs; James Phillips; S. Chakra Chennubhotla; Ian Foster; Rick Stevens; Anima Anandkumar; Venkatram Vishwanath; John E. Stone; Emad Tajkhorshid; Sarah A. Harris; Arvind Ramanathan.

biorxiv; 2021.

Preprint Dans Anglais | bioRxiv | ID: ppzbmed-10.1101.2021.10.09.463779

Résumé

The severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) replication transcription complex (RTC) is a multi-domain protein responsible for replicating and transcribing the viral mRNA inside a human cell. Attacking RTC function with pharmaceutical compounds is a pathway to treating COVID-19. Conventional tools, e.g., cryo-electron microscopy and all-atom molecular dynamics (AAMD), do not provide sufficiently high resolution or timescale to capture important dynamics of this molecular machine. Consequently, we develop an innovative workflow that bridges the gap between these resolutions, using mesoscale fluctuating finite element analysis (FFEA) continuum simulations and a hierarchy of AI-methods that continually learn and infer features for maintaining consistency between AAMD and FFEA simulations. We leverage a multi-site distributed workflow manager to orchestrate AI, FFEA, and AAMD jobs, providing optimal resource utilization across HPC centers. Our study provides unprecedented access to study the SARS-CoV-2 RTC machinery, while providing general capability for AI-enabled multi-resolution simulations at scale.

Sujets)

Infections à coronavirus , COVID-19

IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads (preprint)

Aymen Al Saadi; Dario Alfe; Yadu Babuji; Agastya Bhati; Ben Blaiszik; Thomas Brettin; Kyle Chard; Ryan Chard; Peter Coveney; Anda Trifan; Alex Brace; Austin Clyde; Ian Foster; Tom Gibbs; Shantenu Jha; Kristopher Keipert; Thorsten Kurth; Dieter Kranzlmüller; Hyungro Lee; Zhuozhao Li; Heng Ma; Andre Merzky; Gerald Mathias; Alexander Partin; Junqi Yin; Arvind Ramanathan; Ashka Shah; Abraham Stern; Rick Stevens; Li Tan; Mikhail Titov; Aristeidis Tsaris; Matteo Turilli; Huub Van Dam; Shunzhou Wan; David Wifling.

arxiv; 2020.

Preprint Dans Anglais | PREPRINT-ARXIV | ID: ppzbmed-2010.06574v1

Résumé

The drug discovery process currently employed in the pharmaceutical industry typically requires about 10 years and $2-3 billion to deliver one new drug. This is both too expensive and too slow, especially in emergencies like the COVID-19 pandemic. In silicomethodologies need to be improved to better select lead compounds that can proceed to later stages of the drug discovery protocol accelerating the entire process. No single methodological approach can achieve the necessary accuracy with required efficiency. Here we describe multiple algorithmic innovations to overcome this fundamental limitation, development and deployment of computational infrastructure at scale integrates multiple artificial intelligence and simulation-based approaches. Three measures of performance are:(i) throughput, the number of ligands per unit time; (ii) scientific performance, the number of effective ligands sampled per unit time and (iii) peak performance, in flop/s. The capabilities outlined here have been used in production for several months as the workhorse of the computational infrastructure to support the capabilities of the US-DOE National Virtual Biotechnology Laboratory in combination with resources from the EU Centre of Excellence in Computational Biomedicine.

Sujets)

COVID-19

Résumé

Sujets)

Résumé

Sujets)

Résumé

Sujets)

ENVOYER À:

SÉLECTION CITATIONS

Détails de la recherche